智能论文笔记

Sentence-level Feedback Generation for English Language Learners: Does Data Augmentation Help?

Shabnam Behzad , Amir Zeldes , Nathan Schneider

分类：自然语言处理

2022-12-18

In this paper, we present strong baselines for the task of Feedback Comment Generation for Writing Learning. Given a sentence and an error span, the task is to generate a feedback comment explaining the error. Sentences and feedback comments are both in English. We experiment with LLMs and also create multiple pseudo datasets for the task, investigating how it affects the performance of our system. We present our results for the task along with extensive analysis of the generated comments with the aim of aiding future studies in feedback comment generation for English language learners.

translated by 谷歌翻译

Linear Combinatorial Semi-Bandit with Causally Related Rewards

Behzad Nourani-Koliji , Saeed Ghoorchian , Setareh Maghsudi

分类：机器学习

2022-12-25

In a sequential decision-making problem, having a structural dependency amongst the reward distributions associated with the arms makes it challenging to identify a subset of alternatives that guarantees the optimal collective outcome. Thus, besides individual actions' reward, learning the causal relations is essential to improve the decision-making strategy. To solve the two-fold learning problem described above, we develop the 'combinatorial semi-bandit framework with causally related rewards', where we model the causal relations by a directed graph in a stationary structural equation model. The nodal observation in the graph signal comprises the corresponding base arm's instantaneous reward and an additional term resulting from the causal influences of other base arms' rewards. The objective is to maximize the long-term average payoff, which is a linear function of the base arms' rewards and depends strongly on the network topology. To achieve this objective, we propose a policy that determines the causal relations by learning the network's topology and simultaneously exploits this knowledge to optimize the decision-making process. We establish a sublinear regret bound for the proposed algorithm. Numerical experiments using synthetic and real-world datasets demonstrate the superior performance of our proposed method compared to several benchmarks.

translated by 谷歌翻译

Efficient and Sound Differentiable Programming in a Functional Array-Processing Language

Amir Shaikhha , Mathieu Huot , Shabnam Ghasemirad , Andrew Fitzgibbon , Simon Peyton Jones , Dimitrios Vytiniotis

分类：机器学习

2022-12-20

Automatic differentiation (AD) is a technique for computing the derivative of a function represented by a program. This technique is considered as the de-facto standard for computing the differentiation in many machine learning and optimisation software tools. Despite the practicality of this technique, the performance of the differentiated programs, especially for functional languages and in the presence of vectors, is suboptimal. We present an AD system for a higher-order functional array-processing language. The core functional language underlying this system simultaneously supports both source-to-source forward-mode AD and global optimisations such as loop transformations. In combination, gradient computation with forward-mode AD can be as efficient as reverse mode, and the Jacobian matrices required for numerical algorithms such as Gauss-Newton and Levenberg-Marquardt can be efficiently computed.

translated by 谷歌翻译

NEON: Enabling Efficient Support for Nonlinear Operations in Resistive RAM-based Neural Network Accelerators

Aditya Manglik , Minesh Patel , Haiyu Mao , Behzad Salami , Jisung Park , Lois Orosa , Onur Mutlu

分类：人工智能 | 机器学习 | 神经与进化计算

2022-11-10

Resistive Random-Access Memory (RRAM) is well-suited to accelerate neural network (NN) workloads as RRAM-based Processing-in-Memory (PIM) architectures natively support highly-parallel multiply-accumulate (MAC) operations that form the backbone of most NN workloads. Unfortunately, NN workloads such as transformers require support for non-MAC operations (e.g., softmax) that RRAM cannot provide natively. Consequently, state-of-the-art works either integrate additional digital logic circuits to support the non-MAC operations or offload the non-MAC operations to CPU/GPU, resulting in significant performance and energy efficiency overheads due to data movement. In this work, we propose NEON, a novel compiler optimization to enable the end-to-end execution of the NN workload in RRAM. The key idea of NEON is to transform each non-MAC operation into a lightweight yet highly-accurate neural network. Utilizing neural networks to approximate the non-MAC operations provides two advantages: 1) We can exploit the key strength of RRAM, i.e., highly-parallel MAC operation, to flexibly and efficiently execute non-MAC operations in memory. 2) We can simplify RRAM's microarchitecture by eliminating the additional digital logic circuits while reducing the data movement overheads. Acceleration of the non-MAC operations in memory enables NEON to achieve a 2.28x speedup compared to an idealized digital logic-based RRAM. We analyze the trade-offs associated with the transformation and demonstrate feasible use cases for NEON across different substrates.

translated by 谷歌翻译

Using Unmanned Aerial Systems (UAS) for Assessing and Monitoring Fall Hazard Prevention Systems in High-rise Building Projects

Yimeng Li , Behzad Esmaeili , Masoud Gheisari , Jana Kosecka , Abbas Rashidi

分类：机器人

2022-09-27

这项研究开发了一个无人驾驶系统（UASS）的框架，以监测高层建筑项目中未受保护的边缘和开口附近的跌落危险系统。开发并测试了一个三步基于机器学习的框架，以检测UAS捕获的图像的护栏柱。首先，对护栏探测器进行了培训，以定位支撑护栏的职位的候选位置。由于从实际的工作现场收集的此过程中使用了图像，因此确定了几个错误检测。因此，在以下步骤中引入了其他约束，以滤除错误检测。其次，研究团队将水平线检测器应用于图像，以正确检测地板并删除离地板不近的检测。最后，由于每个帖子之间安装了护栏柱，它们之间的分布差异大致，因此它们之间的空间被估算并用于找到两个帖子之间最有可能的距离。研究团队使用了开发方法的各种组合来监视高层建筑项目的捕获图像中的护栏系统。比较精度和召回指标表明，级联分类器通过落地检测和护栏间距估计来取得更好的性能。研究结果表明，拟议的护栏识别系统可以改善护栏的评估，并促进安全工程师确定高层建筑项目中跌落危害的任务。

translated by 谷歌翻译

Mitigating shortage of labeled data using clustering-based active learning with diversity exploration

Xuyang Yan , Shabnam Nazmi , Biniam Gebru , Mohd Anwar , Abdollah Homaifar , Mrinmoy Sarkar , Kishor Datta Gupta

分类：机器学习 | 人工智能

2022-07-06

在本文中，我们提出了一个新的基于聚类的主动学习框架，即使用基于聚类的采样（ALCS）的主动学习，以解决标记数据的短缺。ALCS采用基于密度的聚类方法来探索数据集群结构，而无需详尽的参数调整。引入了基于双簇边界的样本查询过程，以提高对高度重叠类分类的学习绩效。此外，我们制定了一种有效的多样性探索策略，以解决查询样品之间的冗余。我们的实验结果证明了ALCS方法的疗效。

translated by 谷歌翻译

Automated Wheat Disease Detection using a ROS-based Autonomous Guided UAV

Behzad Safarijalal , Yousef Alborzi , Esmaeil Najafi

分类：机器人 | 人工智能

2022-06-30

随着世界人口的增加，必须修改粮食资源，以提高生产力，抵抗力和可靠性。小麦是世界上最重要的食品资源之一，主要是因为各种基于小麦的产品。小麦作物受到三种主要疾病的威胁，这些疾病会导致大量的农作物产量损害。这些疾病可以通过在正确的时间使用农药来消除。尽管手动喷洒农药的任务是繁重且昂贵的，但农业机器人技术可以通过提高速度和减少化学物质的量来帮助农民。在这项工作中，已经在无人驾驶飞机上实现了一个智能自主系统，以自动监测小麦田的任务。首先，一种基于图像的深度学习方法用于检测和分类感染了疾病的小麦植物。为了找到最佳方法，已经研究了不同的方法。由于缺乏公共小麦滴定数据集，因此已经创建了自定义数据集。其次，使用机器人操作系统和凉亭环境中的仿真提出了有效的映射和导航系统。 2D同时定位和映射算法用于借助基于边境的探索方法自动映射工作空间。

translated by 谷歌翻译

ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image Classification

Thomas Stegmüller , Behzad Bozorgtabar , Antoine Spahr , Jean-Philippe Thiran

分类：计算机视觉

2022-02-15

高分辨率图像和详尽的局部注释成本的过高成本阻碍了数字病理学的进展。用于对病理图像进行分类的常用范式是基于贴片的处理，该处理通常结合了多个实例学习（MIL）以汇总局部补丁级表示，从而得出图像级预测。尽管如此，诊断相关的区域只能占整个组织的一小部分，而当前基于MIL的方法通常会均匀地处理图像，从而丢弃相互作用的相互作用。为了减轻这些问题，我们提出了Scorenet，Scorenet是一种新的有效的变压器，利用可区分的建议阶段来提取区分图像区域并相应地专用计算资源。提出的变压器利用一些动态推荐的高分辨率区域的本地和全球关注，以有效的计算成本。我们通过利用图像的语义分布来指导数据混合并产生连贯的样品标签对，进一步介绍了一种新型的混合数据启发，即SCOREX。 SCOREMIX令人尴尬地简单，并减轻了先前的增强的陷阱，该增强性的陷阱假设了统一的语义分布，并冒着标签样品的风险。对血久毒素和曙红（H＆E）的三个乳腺癌组织学数据集（H＆E）的三个乳腺癌组织学数据集（H＆E）的彻底实验和消融研究验证了我们的方法优于先前的艺术，包括基于变压器的肿瘤区域（TORIS）分类的模型。与其他混合增强变体相比，配备了拟议的得分增强的Scorenet表现出更好的概括能力，并实现了新的最先进的结果（SOTA）结果，仅50％的数据。最后，Scorenet产生了高疗效，并且胜过SOTA有效变压器，即TransPath和SwintransFormer。

translated by 谷歌翻译

Deep Learning Applications for Lung Cancer Diagnosis: A systematic review

Hesamoddin Hosseini , Reza Monsefi , Shabnam Shadroo

分类：计算机视觉 | 机器学习

2022-01-01

肺癌近年来一直是最普遍的疾病之一。根据该领域的研究，每年在美国确定超过20万个案件。不受控制的繁殖和肺细胞的生长导致恶性肿瘤形成。最近，深入学习算法，特别是卷积神经网络（CNN），已成为自动诊断疾病的高级方式。本文的目的是审查不同的模型，导致诊断早期肺癌的不同准确性和敏感性，并帮助该领域的医生和研究人员。这项工作的主要目的是确定基于深度学习的肺癌存在的挑战。经过系统地编写了调查，这些调查结合了定期的映射和文献综述，从2016年到2021年审查该领域的32次会议和期刊文章。在分析和审查条款后，正在回答条款中提出的问题。由于对相关文章的完全审查和系统化，本领域，这项研究优于该领域的其他综述文章。

translated by 谷歌翻译

AI-supported Framework of Semi-Automatic Monoplotting for Monocular Oblique Visual Data Analysis

Behzad Golparvar , Ruo-Qian Wang

分类：计算机视觉 | 人工智能

2021-11-28

在过去的几十年中，智能手机，无人机，空中巡逻和数码相机的发展使大量人口的高质量照片，因此提供了利用全球覆盖率收集大自然和社会的大规模数据的机会。然而，用新的摄影工具收集的数据通常是倾斜的 - 它们难以理工学，大量数据通常已经过时。可以通过称为Monoplotting的技术来解决地理转移倾斜图像数据，该技术仅需要单个图像和数字高度模型（DEM）。在传统的单架中，人类用户必须在图像和DEM中手动选择一系列地面控制点（GCP）对，然后确定相机的外在和内在参数，以在照片和DEM之间建立像素级对应关系启用照片中对象的映射和地理位置。由于包括劳动密集型投入的几项挑战，这种传统方法难以规模，需要丰富的经验来识别明确定义的GCP，以及相机姿态估计的局限性。因此，现有的单幅形方法很少用于分析大规模数据库或近实时警告系统。在本文中，我们提出并展示了一种新型半自动单架框架，提供了需要最小的人类干预的照片和DEM之间的像素级对应。开发了一种分析管道，包括在图像和DEM栅格中的关键点检测，检索地理学的3D DEM GCP，正则化梯度基优化，姿势估计，射线跟踪以及图像像素和现实世界坐标之间的对应标识。两个数值实验表明，框架在3-D坐标中的地理转移视觉数据方面优异，铺平了朝向全自动单架的方法。

translated by 谷歌翻译